Exploration via State influence Modeling
نویسندگان
چکیده
This paper studies the challenging problem of reinforcement learning (RL) in hard exploration tasks with sparse rewards. It focuses on stage before agent gets first positive reward, which case, traditional RL algorithms simple strategies often work poorly. Unlike previous methods using some attribute a single state as intrinsic reward to encourage exploration, this leverages social influence between different states permit more efficient exploration. introduces general construction method evaluate dynamically. Three kinds are introduced for state: conformity, power, and authority. By measuring state’s influence, agents quickly find focus during process. The proposed framework evaluation works well task. Extensive experimental analyses comparisons Grid Maze many Atari 2600 games demonstrate its high efficiency.
منابع مشابه
Parallel Shared-Memory State-Space Exploration in Stochastic Modeling
Stochastic modeling forms the basis for analysis in many areas including biological and economic systems as well as the per formance and reliability modeling of computers and communication net works One common approach is the state space based technique which starting from a high level model uses depth rst search to generate both a description of every possible state of the model and the dynami...
متن کاملState Exploration with Multiple State Groupings
Exploration algorithms are relevant to the industrial practice of generating test cases from an abstract state machine whose runs define the predicted behavior of the software system under test. In this paper we describe a new exploration algorithm that allows multiple state grouping functions to simultaneously guide the search for states that are interesting or relevant for testing. In some ca...
متن کاملThe Bayesian Echo Chamber: Modeling Social Influence via Linguistic Accommodation
We present the Bayesian Echo Chamber, a new Bayesian generative model for social interaction data. By modeling the evolution of people’s language usage over time, this model discovers latent influence relationships between them. Unlike previous work on inferring influence, which has primarily focused on simple temporal dynamics evidenced via turn-taking behavior, our model captures more nuanced...
متن کاملReliable Modeling of Ideal Generic Memristors via State-Space Transformation
The paper refers to problems of modeling and computer simulation of generic memristors caused by the so-called window functions, namely the stick effect, nonconvergence, and finding fundamentally incorrect solutions. A profoundly different modeling approach is proposed, which is mathematically equivalent to windowbased modeling. However, due to its numerical stability, it definitely smoothes th...
متن کاملSteady-state parameter sensitivity in stochastic modeling via trajectory reweighting.
Parameter sensitivity analysis is a powerful tool in the building and analysis of biochemical network models. For stochastic simulations, parameter sensitivity analysis can be computationally expensive, requiring multiple simulations for perturbed values of the parameters. Here, we use trajectory reweighting to derive a method for computing sensitivity coefficients in stochastic simulations wit...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Proceedings of the ... AAAI Conference on Artificial Intelligence
سال: 2021
ISSN: ['2159-5399', '2374-3468']
DOI: https://doi.org/10.1609/aaai.v35i9.16981